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The Generalized Polya Urn (GPU) is a popular urn model which 
is widely used in many disciplines. In particular, it is extensively used 
in treatment allocation schemes in clinical trials. In this paper, we 
propose a sequential estimation-adjusted urn model (a nonhomoge- 
neous GPU) which has a wide spectrum of applications. Because the 
proposed urn model depends on sequential estimations of unknown 
parameters, the derivation of asymptotic properties is mathemati- 
cally intricate and the corresponding results are unavailable in the 
literature. We overcome these hurdles and establish the strong con- 
sistency and asymptotic normality for both the patient allocation and 
the estimators of unknown parameters, under some widely satisfied 
conditions. These properties are important for statistical inferences 
and they are also useful for the understanding of the urn limiting 
process. A superior feature of our proposed model is its capability 
to yield limiting treatment proportions according to any desired al- 
location target. The applicability of our model is illustrated with a 
number of examples. 

1. Introduction. 

1.1. Brief history. Urn models have long been recognized as valuable 
mathematical apparatus in many areas including physical sciences, biological 
sciences and engineering [19, 20]. Among various urn models, the Polya urn 
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(also known as the Polya-Eggenberger urn) model is particularly popular. 
It was originally proposed to model the problem of contagious diseases [10]. 
Since then, there have been numerous generalizations and extensions. For 
example, in information science, Martin and Ho [21] apply the Polya urn 
process to study human decision-making. In population genetics, Benaim, 
Schreiber and Tarres [9] make use of a class of generalized Polya urn models 
to scrutinize the evolutionary processes. In economics, the model is employed 
to capture the mechanism of reinforcement learning by Erev and Roth [13] 
and Beggs [8]. In business, a coupled Polya urn model is utilized by Windrum 
[32] to examine the recent browser war between Netscape and Microsoft. 

Urn models are also extensively applied in clinical studies. Applications 
are mostly found in the area of adaptive design in which urn models are uti- 
lized to provide response-adaptive allocation schemes. In most clinical trials, 
patients accrue sequentially. Assume the availability of several treatments. 
Adaptive designs are valuable and ethical randomization schemes that for- 
mulate treatment allocation as a function of previous responses, with one of 
the important objectives to assign more patients to the better treatment. 

Early works in adaptive design can be traced to Thompson [29] and 
Robbins [24] . Athreya and Karlin [3] successfully embed the urn schemes into 
a continuous-time branching process and generate important limit theorems. 
Among many useful allocation schemes generated under the paradigm of urn 
models, the most influential family of models is the Generalized Polya Urn 
(GPU) [also named as the Generalized Friedman's Urn (GFU) in literature] 
[2, 3, 30]. 

A general description of the GPU model is as follows. To compare K treat- 
ments in a clinical study, consider an urn containing particles of K types, 
representing the K treatments. Patients arrive sequentially to be allocated to 
the treatments. At the beginning, the urn contains Yo = (lo.lj • • • > Yo,k) par- 
ticles, where Yo,fc > denotes the number of particles of type k, k = 1, . . . , K. 
At stage i, i = 1,2,..., a particle is drawn from the urn and replaced. If 
the particle is of type k, then treatment k is assigned to the ith patient, 
k = 1, . . . , K, i = 1, 2, After observing a random variable £j & which rep- 
resents the response of the ith patient on treatment k, additional di(k, g, £i t k) 
particles of type g, g = 1, . . . , K, are added to the urn, where di(k, g, &,&) is 
a function of After n splits and generations, the urn composition is 
denoted by the row vector Y n = (Yn,i, ■ ■ ■ i Yh,k), where Y n ^ stands for the 
number of particles of type k in the urn after the nth split. The urn com- 
position in stage n can be characterized by the following recursive formula: 

(1-1) Y n = Y n _i + X n D n , 

where the matrix D n = (d n (k,g,£ n ,k),k,g = 1,...,K) and X n = (X n ,i, • • • , 
An, ii") is the observed result of the nth draw, distributed according to the 
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urn composition at the previous stage. That is, if the nth draw is a type-A; 
particle, all elements in X n are except the feth component X n ^ which is 1. 
Moreover, define H, = (E[di(k,g, £i,jfc)|.Fi-i], k,q = 1, . . . , K), where the sigma 
field Ti is generated by {Yo, Yi, . . . , Yj,Xi, . . . ,Xj}. The matrices Dj and 
Hj are called the addition rules and the generating matrices, respectively. 
These matrices play important roles in the mechanism of the GPU model. 

A GFU model is said to be homogeneous if Hj = H for all i = 1, . . . ,n. 
For instance, if the addition rule Dj is merely a function of the iih patient's 
observed outcome, the addition rules are independent and identically dis- 
tributed (i.i.d.). Consequently Hj = H = EDj are identical and nonrandom. 
However, in clinical studies, the entire history of all previous trials provides 
more information of the efficacies of the treatments. For applications of such 
models, one can refer to Andersen, Faries and Tamura [1] and Bai, Hu and 
Shen [6]. If the addition rules rely on all the previous outcomes, the GPU 
model is no longer homogeneous. Under these circumstances, we will only 
assume that H n converges to a limit H. 

In clinical investigations, a crucial element of the treatment allocation 
process is the number of patients being assigned to a particular treatment. 
Here, we define N n = (N ni i, . . . ,N u> k)- The quantity N n< k is the number of 
times a type-A: particle is selected in the first n stages. In clinical trials, N n & 
represents the number of patients assigned to treatment k among the first 
n patients. Obviously, 



Owing to its clinical applications, the asymptotic behavior of N n is of im- 
mense importance [15]. Athreya and Karlin [2, 3] obtained the asymptotic 
convergence of Y n and N n and conjecture that N n is asymptotically nor- 
mal. This conjecture has not been solved for almost three decades due to 
its mathematical complexity. A limited success was reported by Smythe [27] 
who derived the asymptotic normality of Y n and N n under the assumptions 
that the eigenvalues of the generating matrix H are simple and the GPU is 
homogeneous. Bai, Hu and Zhang [7] considered functional limit theorems 
of Y n and N n for K = 2. Further, Janson [17] established functional limit 
theorems of Y n and N n for the general homogeneous GPU model and the 
asymptotic normality of N n follows. 

1.2. Objectives and organization of the paper. Recently, Bai and Hu [5] 
established the consistency and the asymptotic normality of the GPU non- 
homogeneous model. However, their theorems rely on very stringent assump- 
tions which are not valid in many practical applications (illustrated in Sec- 
tion 2). In this paper, we derive a sequential estimation-adjusted urn model 



(1.2) 




i=l 
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(SEU) and successfully produce important asymptotic theorems related to 
Y n and N n . Our allocation scheme is suitable for many popular clinical sce- 
narios. Because the proposed urn model depends on sequential estimators 
of unknown parameters, we cannot use the techniques of Bai and Hu [5] to 
show the asymptotic properties. Instead, we show the law of the iterated 
logarithm of the estimators first and then prove the strong consistency and 
asymptotic normality of the SEUs by using the results of the law of the iter- 
ated logarithm. Also we obtain the asymptotic properties of the estimators 
of unknown parameters. 

Another important advantage of our proposed SEU model is that it can be 
used to satisfy a pre-specified treatment allocation target. Different targets 
might be of interest in clinical studies [25]. In fact, there is a growing interest 
in target-based designs which are derived with a pre-specified allocation 
target (see, e.g., [11, 12, 16, 22, 23]). Based on some optimality consideration, 
a suitable allocation target, say v, is used. It will be shown later that we 
are able to define an SEU model to ensure that the allocation proportion 
converges to v almost surely. Illustrative examples are given in Section 4. 

To summarize, the major contributions of this paper are: 

(a) to propose a general nonhomogeneous GPU model (SEU) with the 
addition rules utilizing sequential estimations of unknown parameters; 

(b) to show strong consistency and asymptotic normality of N n under 
widely satisfied conditions and obtain a general and explicit asymptotic 
variance of N n ; 

(c) to state the framework of the targeting procedures of SEU when the 
desired allocation proportion is pre-specified based on some optimality con- 
sideration; and 

(d) to illustrate with examples the applications of the SEU model in 
clinical trials. 

The SEU model will be described in Section 2. Asymptotic properties are 
presented in Section 3. Then in Section 4, selected examples are provided 
to illustrate the application of the SEU model in clinical trials. Concluding 
remarks are also given. Finally, technical proofs are stated in Section 5. 

2. Sequential estimation-adjusted urn model. 

2.1. The SEU process. Before the outline of the SEU model, we adopt 
the following assumption on patient responses. Let the responses {{£,j.k, j = 
1, 2, . . .}, k = 1, . . . , K} be K independent sequences of i.i.d. random vari- 
ables, and write ^ = (£j,i, • • • ,£j,K)- In clinical trial, can only be observed 
when the ith patient is assigned to treatment k, that is, when Xj & = 1. Now, 
a useful nonhomogeneous GPU model with addition rules utilizing the entire 
history of the previous trials is given as follows. 
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The SEU process. Let = (6\, . . . , 9k) be the parameter from the dis- 
tribution of the response £ n = (£ n ,i, • • • , £,u,k)- If the value of is known, it 
is natural to consider the adaptive design with addition rules D(0,£ n ) and 
generating matrices H = H(0) = E[D(0,£ n )], whose values depend on the 
parameter 0. Here, the d x d matrix H(x) is a function continuous at point 
0, and also, as a function of x, the matrix D(x,y) = {Dij(x,y)} is continu- 
ous at point for any fixed possible value y of £ n . However, is usually un- 
known in practice; an estimate of is therefore required. Hence, without loss 
of generality, we assume = E[£ n ], with its estimate n = (0 n> i, ■ ■ ■ , 9 n> K), 

where 9 n k= 1+ ^*=i_ Xl ' fc ^' fc k = 1, . . . , K. Here, adding 1 to both the nu- 
merator and the denominator is only to avoid the nonsense case of 0/0. 
We consider the adaptive design with adding rules D n = D(0 n _i,£ n ) and 
generating matrices H„ = H(0 n _i) = E[D n |jF n _i]. 

The above SEU model has generating matrices H n that are not homo- 
geneous. Bai and Hu [4, 5] established the consistency and asymptotic nor- 
mality of centered urn compositions Y n — EY n for a wide spectrum of urn 
models with generating matrices H n satisfying the condition that 

00 II XT TT|| 

, v ^ — ■» ill — n 

(2.1) E % <°°- 

7=1 J 

Here || • || denotes the norm of a matrix or a vector. This condition is in 
general true for practical applications and it is true for the SEU model. 

Now, let us turn to the asymptotic property of N n that is more useful but 
intricate. Janson [17] gave a general result of the limiting distribution of N n 
for homogeneous urn models (cf. his Theorem 3.28). The basic technique 
of Janson [17] is to embed the urn process in a continuous-time branching 
process. The embedding technique was developed by Athreya and Karlin [3]. 
Unfortunately, this method does not work for nonhomogeneous urns. The 
very recent work of Bai and Hu [5] established the asymptotic normality of 
N n — EN n for nonhomogeneous urn models when the generating matrices 
H n satisfy condition (2.1) as well as the following condition: 

(2.2) ||H n -EH n ||=o(n- 1 / 2 ) in L 2 . 

It is beyond doubt that the asymptotic distributions of Y n and N n with 
the centralization factors EY n and EN n being substituted by nv are more 
realistic, because the values EY n and EN n are both unknown and computing 
them is far too complex, if not infeasible. To provide a possible solution, Bai 
and Hu [5] suggested that EY n and EN n can be replaced by nv if condition 
(2.1) is revised to a more stringent condition: 

00 li tr trii 

(2.3) E ^ " <oo. 
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The above condition is too strict for usual applications. For example, condi- 
tion (2.3) is not true for the SEU model. On the other hand, condition (2.1) 
is satisfied with the SEU model because 



H(0 n _ x )-H(0)^^ 



0H(x) 



k=l 



dx k 



(On- 



l,k 



e 



^)=o(n- 1 /4) 



a.s. 



as indicated in Bai and Hu [5]. In fact, conditions (2.2) and (2.3) are not 
satisfied with the SEU model because the fastest almost sure convergence 
rate of ||H n — H|| is 0(yj (log log n) jn ) according to the law of the iterated 
logarithm and the fastest L2 convergence rate is 0(y/n) according to the 
central limit theorem. So, the general theorems of Bai and Hu [5] do not 
apply here. 



2.2. Targeting pre- specified limiting allocation proportions. As explained 
in the Introduction, it is frequent that desired limiting allocation proportions 
are given and our SEU model has the ability to target these proportions. 
First, let us state the well-known results related to the convergence of N n 
and the generating matrix H (see, e.g., [4, 5, 17]). Under suitable conditions 
the allocation proportion converges to a limit, that is, 

(2.4) ^H^v a.s., 



n 



where v = (vi, . . . ,vk) is the left eigenvector of the (limiting) generating 

matrix H with respect to its largest eigenvalue and satisfying v\-\ h vk = 

1, that is, 



(2.5) 



vH = At, 



To target a pre-specified allocation proportion, one needs to define the SEU 
process in such a way to force the sample allocation proportion to the desired 
one as a limit. To illustrate, we will take the two-treatment case with binary 
responses as an example. 

For a binary response (success and failure) clinical trial with two treat- 
ments (1 and 2), Rosenberger et al. [26] proposed the following allocation 
proportion (called the optimal allocation proportion) by considering simul- 
taneously the statistical testing power for comparing the two treatments and 
the failure rate of patients undergoing the two treatments, 



N, 



n,2 



n 



Pi + \fp~2 
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Pi + \/P2 



Other allocation proportions and a general procedure to find an allocation 
proportion according to an optimization criterion had been discussed by 
Jennison and Turnbull [18]. When the SEU model is used to target the above 
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proportion, we need to define an urn model such that v = ( V 1 Ji 2 ) 
is the eigenvector of its generating matrix H with respect to the largest 
eigenvalue. Assume that the expectation of total number of balls added at 
each stage is the same, say 7. Then H can be written in the following form: 

(") ^). 
According to (2.5), we have 

a = V2 = \/V2_ 

Vl y/pl' 

One simple solution of the above equation is a = ^fp~2 and j3 = y/p\. Another 
obvious solution is a = vi and (3 = v\. In either case, the generating matrix 
is a function of p\ and p2- If pi and P2 are known, to implement the alloca- 
tion proportion, we can define an SEU model with addition rules Dj = H. 
Further, when p\ and P2 are unknown, their estimates will be employed as 
instructed in the previous subsection. Examples are provided in Section 4. 

3. Asymptotic properties. In this section we will study the asymptotic 
properties for our SEU model. Major theorems will be given with proofs 
stated in the last section. Owing to the use of estimates of the unknown 
parameters, one will discover that the asymptotic variance is different from 
that given in [5] and [17] and the mathematical techniques are much more 
complex. 

Recall that Y n and N n are defined by (1.1) and (1.2) and H = H(0), 
D n = D(0 n _i,£ n ). To study the asymptotic properties of Y n and N n , the 
following assumptions are required. 

Assumption 3.1. We assume that HI' = 71', where 7 > and 1 = 
(1, . . . , 1). Suppose H has the following Jordan decomposition: 

T- 1 HT = 7 diag[l,J 2 ,...,J s ], 

where J s is a Vt x vt matrix, given by 





1 


. 


• °\ 





A/ 


1 . 


. 








Xt ■ 


. 


u 





. 


. x t ) 



Without loss of generality, we assume that 7 = 1. Otherwise, we can consider 
Y n /7 and D n /7, instead. 
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Let v be the left eigenvector of H corresponding to its maximal eigen- 
value 1 with vl' = 1'. We may select the matrix T so that its first column 
is 1' and the first row of T _1 is v. Let A = max{Re(A2), ■ • ■ , Re(A s )} and 
v = maxj{z/j : Re(Aj) = A}. 

Assumption 3.2. For the addition rules D n = D(0 n _i,£ n ) and the 
responses £ n , suppose Z)jj(x, y) > is continuous at point (0,yo) for each 
yo and there is an r > 2 such that E||£ 1 || r < oo and 

(3.1) supE[||D n || r |.F n _i] < oo a.s. 

n 

Assumption 3.3. For the function H(x), 

H(x) - H = H(x) - H(0) = 0(||x - 0||) as x -» 0. 

Assumption 3.4. For the function H(x), there is a 5 > such that 



K 



K ' ^ dx, 



k=i " Xk 



(x fc -^) + O(||x-0|| 1+<5 ) asx^0. 

x=© 



Remark 3.1. Condition (3.1) is satisfied if we assume that E[||D(x,^ 1 ] 
is a continuous function of x. 



Using the above notation, we shall establish the following results. 

Theorem 3.1. For the SEU, suppose that Assumption 3.1 is satisfied. 
If A < 1, and Assumption 3.2 is satisfied for some r > 1, then 

Yn , N n 

> v and >v a.s. 

n n 

Theorem 3.2. For the SEU, if Assumptions 3.1-3.3 are satisfied, and 
Vk > 0, k = 1, . . . , K , A < 1, then for any n > (1/2) V A, 

n~ K (Y n — rev) — > and n~ K (N n — rev) ^ a.s. 

Theorem 3.3. For the SEU, if Assumptions 3.1, 3.2 and 3.4 are satis- 
fied, and Vk > 0, k = 1, . . . ,K, A < 1/2, then 

n 1/2 (Y n /n- v) 3 N(0,A^) and n 1/2 (N n /n - v) Z N(0, A fl ), 

where N(0,Al) and iV(0,A*) are the normal distributions in 1Z K with mean 
zero and K x K variance-covariance matrices A^ and A", which will be 
specified by (5.24) and (5.26), respectively. 
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Let H = H — l'v and D = 0(0,^). Define S x = diag(v) - v'v, S 2 = 
E[(D - H)' diag(v)(D - H)] , £3 = diag^ia 2 , v K a 2 k ), S 23 = E[(D - H)' x 
diag(v) diag(£i — 0)], where a\ = Var^^), k = 1,2, ... ,K. Covariance- 
variance matrices A* and A* are functions of H, Si, S 2 , S3 and £23 • 

Remark 3.2. If A = 1 / 2 , the asymptotic normalities also hold, but with 
different normalizing, given by n 1//2 log^ 1 / 2 n, and different and A". 

From the proof of Theorem 3.3, one also obtains the following corollary. 

Corollary 3.1. For SEU, if Assumptions 3.1 and 3.2 are satisfied, 
and Vk > 0, k = 1, . . . , K, A < 1, then 

n 1 / 2 ^ - 0) £ JV(0,diag(<7?M, . . . , a 2 K /v K )), 
where a\ = Var(£i ife ), k = l,...,K. 

The next corollary provides results on a special case of SEU. 

Corollary 3.2. Let p(x) = (pk-(x), . . . ,p^(x)) be a vector function be- 
ing twice differentiable at and pfe(x) > 0, h = 1, .. . ,K. Consider an adap- 
tive design with addition rules D n = l'p(0 n _i), that is, pk(® n ~i) balls of 
type k are added to the urn at stage n, k = 1, . . . , K, where n -i is the 
sample estimate of & which is defined as in SEU. Then 

Y N 

>p(0J a.s., >v a.s. 

n n 



and 



Y 



n 



^(^-p(0))^AT(O,2S,), 



N n \ D 



iV(0, diag(v) — v'v + 6S„), 



where 7 = Efc/ ) fc(©), v = p(0)/7 ; 

fdp(@)V , dp(&) 
1 90 j { ' 8® ' 



^ v ~[ d® J 1 (0) d® ' 

l- 1 (0)=diagf^,...,^) 

\Vl V K J 



and v(x) = v^^Vr- Here denotes the matrix { 9p g^ \*=®',i = 1 5 • • • j K, 

i = l,. ..,#}. 
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4. Applications and conclusions. In this section we give some motivating 
examples in order to show the applicability of our SEU model. The first one 
is related to allocation schemes which aim at assigning more patients to the 
better treatment. The remaining two are connected to designs with focus on 
desired targets. 



Example 1 ([6]). Consider a iT-treatment clinical trial with binary re- 
sponses. Let pk be the success probability of a patient on treatment k, and 
let q k = 1 — p k be the failure probability, k = 1, . . . , K. Bai, Hu and Shen 
[6] (BHS Adaptive Design) proposed to add particles proportional to the 
estimated success rates for other K — 1 treatments, when the response is 
a failure on treatment k, that is, at the nth stage, a success on treatment 
k generates a particle of type k, and a failure on treatment k generates 

(M^i-fit-i k) P articles of tyP e 3 for a11 3 / k - Here M n-1 = EjLlPn-l,j, 

Pn-i,j = 7^-~p"p an d Sn-i,j denotes the number of successes of treatment 
j in all the N n —ij trials of the previous n — 1 stages, j = 1, . . . , K. They es- 
tablished the strong consistency of Y n and N n of this model. For this design, 
H(x) = (h kJ (x.),k,j = 1,...,K), where h k ,k(x) = Pk and h k ,j(x) = qk ^ 3 „ 

for k^j. Using Theorem 3.3, we can get the asymptotic normalities. 

Example 2. We consider a two-treatment clinical trial with binary re- 
sponses. Suppose that we want to target the optimal allocation proportion 
proposed by Rosenberger et al. [26]. In (2.6), we choose a = yfp~2, (3 = yjpl 
and 7 = y/p~[ + ■ s /p~2~. The design is defined as follows. At the nth stage, no 
matter what the response of the nth patient is, we add Vp n -i,i particles of 
type 1 and y/p n -i,2 particles of type 2 to the urn. If < pi,P2 < 1, then 



It can be verified that 

S p = diag 

and 



Ql^/Pl <12^/P2 



4(v^T + y/pl) ' 4( v^I + Vp~2~ : 



^ i (pm\ , P\i2 \ ( l -l 

— 77 — . — no . — + 



By Corollary 3.2, 

'Y n i Y n2 \ , — — , 

, — - -> {VPi,VP2), 

n n ) 

fN n>1 iY„, 2 \ ( ypi yp~2~ 



V n ' n J Vv^T+ V^2' + \/P2 



a.s., 
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Example 3. We consider a two-treatment clinical trial with binary re- 
sponses. Suppose that we want to target the allocation proportion (v\ = 
qi q +q 2 > v 2 — qi q +q 2 ) °f the randomized play-the- winner (RPW) rule proposed 
by Wei and Durham [31]. In (2.6) we choose a = v 2 , = v i and 7=1. 
Then H = (1, \)'{v\,v 2 ). The design is defined as follows. At the nth stage, 
regardless of what the response of the nth patient is, we add ^ — f"+j 2 { 2 
particles of type 1 and 
Qn-l,k = 1 —Pn-i,k 



9n-l,l 



fc = 



particles of type 2 to the urn, where 
1,2. If <pi,p 2 < 1, then 



Dfx) 



1 — x 2 



1 — X\ 



(1 - Zl) + (1 - x 2 ) 
1 - x 2 



(1 - o:i) + (1 - x 2 ) 

1 — Xl 



\(1-Xl) + (1 

92 



x 2 ) (1 - xx) + (1 - x 2 ) J 
Qi 



D 



9i + 92 qi + 92 

92 91 

91 +92 9i + 92 



and H(x) = D(x), H = D. It can be verified that Jj p = S„ = qiq 2 (pi + 
P2)/(9i + 9 2 ) 3 (1, -!)'(-!, !)• By Corollary 3.2, 



ii 



V2 



n 





/^n,l Yn,2 




\ n ' n 




/A^l JV n ,2 




\ n ' n 


92 


^n,2 91 


91 + 92 


n 91 + 92 



92 



91 



v 



9i + 92 9i + 92 

92 91 
9i + 92 ' 9i + 92 

7V(0,ct + 2 )(1,-1) 



a.s., 
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and 



n 



1/2 



( 



n 



qi + q2 



where 




2giq2(Pi + P2) 
(qi + q2f 



and <7? 



gig 2 [2 + 5(pi + p 2 )] 
(qi + q2f 



The asymptotic variances are much smaller than those given by the RPW 
rule when q\ + q 2 is near or less than 1/2, since when q\ + q 2 > 1/2, the 
asymptotic variances of Y n) \jn and N n \/n in the RPW rule are 



respectively (cf. [28]). When q\ + q 2 < 1/2, the limiting distributions of Y n 
and N n in the RPW rule are unknown, and the convergence rates of the 
variances are much slower. But, for the adaptive design given here, the 
asymptotic normality can be evaluated according to our formula for any 
0<qi,q 2 < 1. 

Examples 2 and 3 demonstrate the ability of the SEU model to target a 
given allocation proportion. In general, we can use Corollary 3.2 to target 
any desired allocation proportion. This is a useful property of our proposed 
model. 

The examples in this section provide illustrations of how the proposed 
techniques are translated into applications in the formulation of valuable 
treatment allocation schemes. We can also evaluate these schemes by re- 
viewing the respective asymptotic properties of Y n and N„. 

The main contributions of this paper are in proposing the family of se- 
quential estimation-adjusted urn models and showing the two important 
properties of the SEU models: (i) they can be used to target any given allo- 
cation proportions, and (ii) they have certain desired asymptotic properties 
under some widely satisfied conditions. Based on the asymptotic results, we 
are able to derive new designs which have smaller variabilities than some 
traditional and popular urn models (see Example 3). 

In this paper we considered the application of the SEU models in clinical 
trials. The idea of using constantly updated estimators in SEU models is 
novel and the SEU models can be employed in other areas. In fact, there are 
abundant urn model applications where data are collected sequentially and 
the consecutive estimations of unknown parameters provide better solutions 
to the formulated problem. 

For simplicity, the examples considered in this section are all binary re- 
sponses. However, the SEU models can be applied to different types of re- 
sponses. From the definition of the SEU process, the response £ n could be 



Q1Q2 



and 



q 1 q 2 [l+2(p 1 +p 2 )} 



[2( qi +q 2 )-l}( qi +q 2 y 



[2(91 +q 2 ) -\}{qi + q 2 ) 



2 ■ 
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discrete or continuous. For example, we can apply the SEU model to the 
case studied in Section 8 of [12]. In that case, the response is normally dis- 
tributed and the target proportion is the Neyman allocation. Generally, the 
assumptions in Section 3 are satisfied if the third moments of £ and D exist 
and the function H is differentiable. Therefore, the SEU model has a wide 
spectrum of applications. 

5. Proofs. First, we state several relevant lemmas. Recall H = H — l'v. 
Then 

(5.1) T- 1 HT = diag[0,J 2 ,...,J s ]. 



Lemma 5.1. Let B n>n = I and B Uji = Yl^ =i+1 (I + j H). If two 
quences of matrices Q n and P n satisfy Qo = Po = and 

n-l Q fc 

Qn = Pn + 2_\ . H, 

that is, 

Q n = AP n + Q„_ 1 (I + n- 1 H), 
where AP n = P n — P n _i is the difference ofP n , then 

n n— 1 jj 

(5-2) Q n = y AP m B n m = P n + y P m ■ —B n m+l- 

^— ' ' m + l 

m=l m=l 

Also, 

(5.3) ||B nm || < C(n/m) x log u ~ 1 (n/m) for all m = 1, . . . ,n,n>l, 
where logx = ln(x V e) hereafter. 



se- 



PROOF. See Lemma A.l of [16]. □ 



Lemma 5.2. For the SEU, if Assumption 3.2 is satisfied with some 
r > 1, then N n ^ — > oo, k = 1, . . . , K, as n — > oo. Furthermore, n — > 

a.s. whenever E||^ 1 || < oo, and 6 n ^ — Ok = 0(y^^ L ^ g ^"' fc ) a.s. whenever 

E||£il| 2 < ° as oo, k = 1, . . . , K . 

Proof. First, notice that {||D n || — E[||D„|| |.F n _i]} is a sequence of mar- 
tingale differences with 

supE[||D n |||^ n _ 1 ]<sup(E[||D n |n^n-i]) 1/r <cx) a.s. 
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and 

- E[|||D W || - EtllDnlH^-iiri^n-i] < y> C 

n=l n=l 



< oo a.s. 



by condition (3.1). According to the law of large numbers of martingales 
(cf. [14]), 

n n 

Y, ||D m ||=^ E[||D m |||^ m _i] + o(n) = 0(n) a.s. 

m=l m=l 

So, by (1.1), ||Y n || < ||Y || + Em=ill D m|| = 0(n) a.s. It follows that 
Xm=i(Ej Yn,j) = oo a.s. So, for each k = 1, . . . ,K, 

OO OO OO \f 

E ^ X n, k = l\?n-l) = E > E yF^- = - 

n=l n =l^j n >3 n=l^3 ln j 

which, together with the generalized Borel-Cantelli lemmas, implies that 
P(X n k — 1) i-o.) = 1. Then N n f. — > oo a.s., k = 1, . . . , K. By Lemma A. 4 
of [16] again, the lemma is proved. □ 

Now, let |Y n | = Y n l' be the total number of balls in the urn at the 
nth stage, and let M n = J2k=i and m n = J2k=i Am^, where AMj, = 

X fc D fe - E[X fc D fe |J"fc_i] and Am fe = X fe - E[X ft |.Ffc_i]. Then {(m n ,M n ), T n ; 
n > 1} is a martingale sequence. By (1.1), 

n-l y 

(5.4) Y n = Y + M n + £ prAH m+1 . 

m=0 I Ym l 

Note that vl'v = v and vH = 0. By (5.4), it follows that 

n-l , y \ « 

Y n -nv = M n +^ — =L-v H m+1 + £ v(H m -H)+Y 

m=0 V|lm| / m=1 
n-l , y \ n 

= M n+E P^T- V H+ E v (H m -H) 

n-l / v 



(5.5) 



+ E r^" v (H m+ i-H)+Y 
m=0 V|*m| / 

n-l / y \ n 

M «+ E pC^- v H + E V ( H -" H ) 
+ E(^- v )( H -+i- H )+ Y o 

m=0 V|Im| / 



where 



(5.6) 
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n— 1 -y 

= M n +^ — mv — ^ 
^— ' m + 1 

m=0 



^=tv(H.-H) + E(^-)(l-^)H 

m=l m=0 ' ' 7 x 7 

n-l / v 



Y , 

1 m| 



+ E(^V v )( H ™+i- H )+ Y o- 

m=0 



Also by (1.2), 



N n - nv = m n + 



n-l v 
1 m 



,Y I 

m=0 I Im l 
n-l / y 



nv 



m=0 
n-l 



»+E(r^T-v)(i-i'v) 



^E(^T-v)d-l'v) 



m ,„ 

m=0 
n-l 



(5-7) +E^t( 1 -^H)( I - 1,v ) 

m=0 I * ' ' ' 
n— 1 

^ Y m - mv , 
= m n + 2^ -7— 1 - 1 v 

^ n 771+ 1 
m=0 

+ E(^-v)(l-^)(I-l'v). 

m=0 1 1 

Proof of Theorem 3.1. First, note that r > 1, ||Am n || < 2, and ac- 
cording to (3.1), 

(5.8) E[||AM n |r|^n-i]<2 r supE[||D n |n^„-i]:=2 r 77 r <oo. 

n 

By the strong law of large numbers of martingales, we have 

(5.9) M n = o(n) and m n = o(n) a.s. 

On the other hand, by Lemma 5.2 and the continuity of H(x), we have 

(5.10) H„-H = o(l) a.s. 
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Note that Hi' = 0', and Y m /|Y m | — v is bounded. From (5.6), it follows 
that 

11 n_1 / Y \ 

R n l' = £ v(H. - H)l' + ]T mtA " v (H m+1 - H)l' + Y 1' 

(5.11) m=l m=0 ^l Ym l ' 
= o(n) a.s., 

which, together with (5.5) and (5.9), implies that 

|Y„| - n = (Y„ - nv)l' = M n l' + R n l' = o(n) a.s., 

that is, 

IY I 

(5.12) i— H-fl a.s. 

n 

Now, by combining (5.6), (5.10) and (5.12) it follows that 

R n = o(n) a.s., 
which, together with (5.5) and (5.9), yields 

n— 1 

Y m - mv- 

Y n - nv = > — — H + o{n) a.s. 

^ n m + 1 

m=0 

And then by Lemma 5.1, we conclude that 

n— 1 / \ 

Y n — nv = o(n) + (n/m) x \og u ~ 1 (n/m) = o(n) a.s., 

m + 1 

m=l 

that is, 

Y n 

> v a.s. 

n 

Finally, from (5.7) it follows that 

N„ - nv = o(n) + £ + £ f ^ - v) o(l) = o(n) a.s. 

The proof of Theorem 3.1 is complete. □ 

To prove Theorem 3.2, we need the following lemma. 

Lemma 5.3. Under the assumptions in Theorem 3.2, we have 



e n -@ = oU l -^^\ and H„-H = o Jl^ a.s. 
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Proof. Note that by Theorem 3.1, 

n 1 
— >— a.s., k = 1,...,K. 

So, by Lemma 5.2, 



e n , k -e k = ofJ l ^j i ,,. k -L....K. 

The proof is then complete by noting Assumption 3.3. □ 

Proof of Theorem 3.2. First, note that ||Am n || < 2, (5.8) and r > 2. 
By the law of the iterated logarithm of martingales, we have 

(5.13) M n = 0(\/n log log n) and m„ = 0(\/n log log n) a.s. 

By Lemma 5.3 and similarly to (5.11) we have 



a.s. 



= 0{\/n log logn), 
which, together with (5.5), implies that 

|Y«| -n = M n l / + R n l / = 0( v / nloglogn) a.s., 

that is, 



( , 14) ny_ 1=0 ^^ a ., 

And then from (5.6), it follows that 



a.s. 



= 0(y/n log logn). 

With (5.5) and (5.13), we conclude that 
n— 1 -«t- 

Y n — nv = — — H + 0(\/n log logn) a.s. 

^— ' m + 1 

m=0 

which, together with Lemma 5.1, implies that 



A 0(y/m log log m) A 
Y n — nv= (n/m) log {n/m) 



m + 1 

m=l 



a.s. 



( 5 - 15 ) f O( Vnloglogw ), if A < 1/2, 

0{y/n log log n log 1 ' n ) , if A = 1/2, 
0(n A log^n), if A > 1/2. 
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Finally, by (5.7) and (5.13)-(5.15) we conclude that 



N w - nv = 0(v/nloglogn) + ]T ° (I|Y " mv| 

111+1 



n-l 



' log log m 



a.s. 



m 



m=l 

O( Vraloglogn ), if A < 1/2, 

O ( \fn log log n log 1 ' n ) , if A =1/2, 

0(n A log" -1 ra), if A > 1/2. 

Theorem 3.2 is proved. □ 

Proof of Theorem 3.3. Without loss of generality, we assume that 5 
is small enough such that S < 1/2 — A. Define 



s „ dH(x) 
(5.16) f fc = v- 



dx k 



x=e 



fi/vi 



and Q n = (Q n ,i, . . . , Q u ,k) = Em=l AQ m , where AQ. m = X m diag(£ m - 0). 
Then Q n is a martingale since £ m is independent of X m and the cr-field 
T m -\. By Lemma 5.3 and Assumption 3.4, 

v(H n+1 - H) = (0 n - 0)(f/, . . . , f£)' + O(||0 n - 0|| 1+<5 ) 

= (0„ - 0)(f/, . . . ,f£)' + o^" 1 / 2 "* 5 / 3 ) a.s. 

Also, by Theorem 3.2, Q n = 0(y/n log logn) a.s. and = 1+ ^'^'/"'^°'' t ; 
A; = 1, . . . , K, we have 

On,k — Ok = "TP —T — (Qn,k + 1 ~ Ok) 

N njk + 1 n 

11. / n 1 \ 1 1-0* 

— Qn,fc + "Tt TT —Qn,k + 



u fe n ' V.A'n.fc + l Wfc/n ' iV nife + l 

= — -Qn,fc + oK-^CKn-^Vloglogn) + of- 
f fc re \n 

= — -Qn,fc + o(n" 1/2 " 5 ) a.s. 
f fc re 

It follows that 

(5.17) v(H n+1 -H) = iQ„F + o(n- 1 / 2 - 5 / 3 ) a.s. 
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Now, from (5.6), (5.17), Theorem 3.2 and Lemma 5.3 it follows that 

n n—1 

R n = V ( H ™ " H ) + E °(( ? 



Jm 

m=l m=l 
n-l 



I log log m 



, m 

m=l 



■m 



+ ^ o(m- 1+K )0| 

m=l 

fV + ^l a.s. 

So, by (5.5), 



n -I n—1 y 

Y n -nv = M„+ £ —QmF + £ Ym "7 V H 

(5.18) 771 + 1 

+ (n-V2-V3) a . s . 

Let Uo = and 

n-l 



n n—1 Tj 

U n = M n + V — Q m F + V -^-H. 

m „ m + 1 



m=l m=0 

Then 

n—1 y tt 

Y„-nv-U„= £ Ym ~ m ^~ Uro H + oCn 1 /^) a . s . 



By Lemma 5.1, it follows that 



Y re -nv = U n +5>(m- 1/2 - 5 / 3 ) - log 



m=l 



m J \m 



(5.19) 

= U n + o^n 1 / 2 - 5 / 4 ) V (n A log"" 1 n)) a.s. 
On the other hand, by (5.2), 

Un=J2 ( AM m + -QmF)B„, m 



=1 ^ 



n n 71 / 1 \ 

= £ AM m B 

n,m 

+ ]T AQ m F Y, ~ B n,j ■ 

m=l m=l j=m 

Also, by (5.7) and Theorem 3.2, 



n—1 -y n—1 



N n - nv = m n + ^ 'm mv _ lV + ^ 

m=0 m=l 
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= m»+E-% T (I-l , v) 
^— ' m + 1 

m=0 

+ o((n 1 / 2 " 5 / 4 ) V (n A log^ 1 n)) + o^ 1 / 2 "* 5 ) 
n— 1 /n— 1 ^ \ 

(5.20) = m n + ^ AM m ]T —y B 4>m (I - l'v) 

m=l \i=m / 

n—1 /n—1 i -i \ 

+ E A Q™ F E E T^y - d (i - i'v) 

m=l \i=m j=m V /J / 

+ o((n 1 / 2 " 5 / 4 ) V(n A log" -1 ™)) 
:= V n + o((n 1 / 2 " 5 / 4 ) V (n A log^ 1 n)) a.s. 



So, to show the asymptotic normalities of Y n and N n , it suffices to show 
the asymptotic normalities of U n and V n . Write Bj,,™ = Ej= m ~j L > Bn,m = 

E?=™ %f and Bgk = E;= m where E.U+iO = and so B« n = 

0, i = 1,2,3. Then 



U n = ^ (AM m B„, m + AQ m FB$J, 

m=l 

(5.21) 

V n = ( Am ™ + AM m B( l 2 ) ra (I - l'v) + AQ m FBg) m (I - l'v)). 

m=l 

Note that U n and V n are sums of martingale differences. We will use the 
central limit theorem for martingale (cf. [14]) to prove our result. According 
to (5.8), 

1 n 

{E[|| AM m B n , m + AQ TO FBWjn^m-i] 

m=l 

+ E[||Am m + AM m Bg) m (I - l'v) 

+ AQ m FBg) n (I-l' V )|n^ 1 -i]} 
G x-^ I I n \ , „ i n 



(5-22) <^E - !°S - 

m=l ' 



x E[||Am m |r + ||AM m || r + ||AQ ro f|J> 



m— 1 1 



< 4-2 E ((-) V- 1 -Vu + + E\\U\ r } 



m=l 
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a.s. as n — > oo, 



that is, the Lindberg condition is satisfied. Finally, it is enough to calcu- 
late the asymptotic (conditioned) variance-covariance matrices of U n and 
V„. Recall Ei = diag(v) - v'v, S 2 = E[(D - H) / diag(v)(D - H)], £ 3 = 
diag(uicrf,...,^a|), E 23 = E[(D - H)'diag(v) diag(^ x - 0)], where D = 
D(0,^i) and a\ = Var(£j fe), k = 1,2, . . . , K. Notice (5.8) and the continuity 
of the functions D(-,£ n ) and H(-). Then by Theorem 3.1, Assumption 3.2 
and Lemma 5.2, we have 



and 



Vax[AM„|.F n _ 



Dn diag 



I n-l 



+ H' 



H n ) diag 
Y n _i 



Y n _i 



Hi n ~ 1 u 
„ T^T 7 T^Z T-tlr 



™|Y„_i| |Y n _n 



D r 



diag 



Y n _i 

Y n _i Y n _i 



|Y n _x 

^Ss + H'SiH a.s. 
Var[Am„|J" n _i] 



|Y n _i| Y n _ 



H n )|J^ re _i 
H„ 



diag 



n-l 



|Y n _x 

Var[AQ n |^ ft _i] 



'n-l 1 n- 

|Y n _i| |Y n _ 



a.s., 



E[diag(AV(£n,i - #i) 2 , • • .,X n , K ^ n , K - Ok?)^!] 



diag 



|Y n _i| 
Cov[(Am n ,AM n )|^ n _i] 



n-l 



a.s., 



diag 



n-l 



V' V 

1 n—1 x n- 



n-1 



n-l 



|Y n -i| 
— > SiH a.s., 

Cov[(Am ra ,AQ„)|^„_ 1 ] 

= E[(X n - E[X n |^ l _ 1 ]) / X n diag(^ n - 0)|^„_i] 

= 



Cov[(AM n ,AQ„)|.F, 



n-l 
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D^diagf X"" 1 ) diag(£ n - 0)1^-1 



n-l 



£23 a.s. 



So, 
(5.23) 

By noting (5.21) 
we can write 



Var[(Am n , AM n - Am n H, AQ n )|^ n _i] 



Ei 

S2 S23 

s 23 s 3 



a.s. 



HB nm = B nm H and H(I — l'v) = H, 



U n = U«H + Ui 2 )+U( 3 ) 



and 



where 



V n = V« + Vi 2 ) (I - l'v) + Vf (I - l'v) , 

n 

U« = J2 Am m B n , m , 

m=l 
n 

U<?> = (AM m - Am n H)B n , m , 

m=l 

= £ AQ m FB« m , 

m=l 
n n— 1 

V« = Am - + £ Am m HB( 2 ) ra (I - l'v) 

m=l m=l 
n-l 

= Am n + £ Am^B^H + 1) 

m=l 
n-l 

= Am n + ]T Am m B n , m = u£\ 

m=l 

n 

Vf = X: (AM m - Am m H)B( l 2 ) n , 

m=l 

n 

V(f) = £ AQ m FBg) n . 



m=l 
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Var(U«|)=£E[(Am m B 

n,m) Am m B n)?Tt |.7 : " m— l] 

m=l 
n 

= £E; >m (£i + (l))E n , m 

m=l 

m=l 



2^-2 



71 



1 /l\H 



1/n \X 



H 



+ o(n) 

J fi^ Ei ^i^j da; + o(n) := n\\ + o(n) a.s., 



where a H is defined to be e Hlna = J2^=o ^jf H J . Here we use Var(-|) to 
denote the sum of conditional variance-covariance matrices of related mar- 
tingale differences, and Cov{-,-|} is defined similarly. Also, 



Var(ljK 2 )|) = XX im (£ 2 + o(l))B r 



m=l 



n 



Vi\ H _ /l 



o \x 



H 



cte + o(n) := tiAJj + o(n) a.s., 



Var(U^|) = ^ (Bg^VCEs + o(l))FB( ] 



(i) 



m=l 









[/"-(-)"<* 


F'£ 3 F 








Ux y\yJ J 



+ o(n) 



1/1 



- - dy 

x y\y 



H 



F'E S F 



1 (\ 



- - 

x y\y 



H 



= n dx 
Jo 

:= nAj + o(n) a.s., 

n 

Cov{U«,u( l 2 )|}= ]T E[(Am m B nim ) / (AM m -Am n H)B nim |^ m „ 1 



+ o(n) 



m=l 



= o(n) a.s., 
Cov{U«,Uf |} = a.s. 
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Cov{U( 2 ),U( 3 )|}= £ B / nim (S 23 + o(l))FB( 1 ) 



m=l 



n / n \ H 

dx(-\ £ 23 F 



= nj\x(^j £ 23 F 
:= nA 23 + o{n) a.s. 



n l/n\ H 
- - dy 

x y\yj 



l fl 



- - rfy 

Ux y\y 



H 



+ o(n) 
+ o(n) 



Note that ||a H || < Ca x \og u 1 a for a > 1. The above integrals are well de- 
fined. It follows that 

(5.24) n _1 Var(U„|) -^H'A|h + A 2 + A^ + A 23 + (A 23 )' :=A f . 
And then by the central limit theorem for martingales (cf. [14]) 



(5.25) 
Similarly, 



r? 



^U^iV^At). 



Vax(V^I) = Var(U«|) = nA\ + o(n) a.s., 

n 

Var(vi 2 )|)= ^(Bg) n )'(S 2 + (l))Bg) n 



m=l 



dx 



n i (y 



H 



- - 

x y \x ' 



H 



= n / <ix / - 1 — I dy 

Jo Ux y\xj 

:= nA 2 + o(n) a.s., 

n 

Vai(V,(j» |) = Y. (Bgj'F'(E 3 + »(1))FB< 3 > 



x y\x 



+ o(n) 
+ o(n) 



m=l 



dx 

x F'£ 3 F 

l 

n I dx 



dy I du — (— 

J x yu \u 



Hi / 



dy ( du — [ — 

J x yu\u 



Hi 



+ o(n) 



o 



dy [ du — \ 

x Jx yu \u, 
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dy [ du — [ — 

J x yu\u 



+ o(n) 



:= nA' + o(n) a.s., 
Cov{v( 1 ),v( 2 )|} = o(n) a.s., Cov{\£\ \^\} = a.s. 
and 

Cov{V( 2 ),V(?)|} 



m=l 



dx 



H 



n I dx 

o 



Jx y\x 



dy 



F'X^F 



}_(y_ 

yu\u 



dy 



F'X^F 



r-n ry 

/ dy du 

Jx Jx 

dy I du — i — i 



X J X 



yu\u 



+ o(n) 
+ o(n) 



ix y \x 

:= nA 23 + o(n) a.s. 
It follows that 

n- 1 Var(V n |) -» A{ + (I - v'1)A|(I - l'v) 

(5.26) + (I - v'l)(A 3 + A» 3 + (A| 3 )0(I - l'v) 

:= A tt a.s. 

And then by the central limit theorem for martingales (cf. [14]), 

(5.27) r^V^JVfO.A"). 

Combining (5.19), (5.20), (5.25) and (5.27), we complete the proof of The- 
orem 3.3. □ 

PROOF of Corollary 3.1. Note that Q n = 0{n 1 / 2 ) in L 2 . By Theo- 
rem 3.1, we have 



&n,k — Ok — Qn,k + 

v k n 



n 



1 1 

Vk n 
1 1 

On the other hand, 



N n> k + 1 Vk J n 
Qn,k + o{l)0{n~ l l 2 ) + o(- 



^ ' N n . k + 1 



Qn k + o(n x / 2 ) in probability, k = 1, . . . , K. 

Vk n 
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The result follows. □ 



Proof of Corollary 3.2. Now, HI' = 7, we shall consider Y„/7 
instead of Y n , and D (x) /7 instead of D (x) . Notice that D (x) /-y = H (x) = 
l'p(x) /7, and then S 2 = S 23 = 0. It follows that A ] 2 = = = A 23 = 0, 
and then = (EL/j)' A\(U/^) + \\ and_A» = Aj + (I - v'l)A 3 (I - l'v) by 
(5.25) and (5.26). Also, H/7 = l'v and H = H/7 - l'v = 0. Thus 



^ Jo \x) 



dx = J}\ = diaff(v) — v v, 



(H'/7)A t 1 (H/ 7 )=0, 



dx 



l dy 



and 



dx 



1 



1 dy p> du 



x y Jx u 



(F/7)'S 3 (F/7)=2(F/7) , S 3 (F/7) 



(F/7)'S 3 (F/7) 



-(lnx) 4 (F/ 7 )'S 3 (F/7) dx = 6(F/ 7 )'£ 3 (F/ 7 ). 
4 

Notice that v<9H(x)/<9xfc = dp(x)/dxk- It is easy to see that 



7 2 V 90 / 



50 



Hence A f = 2S p / 7 2 and A s = Aj + 6(1 - v'l)£ p (I - l'v)/ 7 2 . By Theorems 
3.1 and 3.3, we conclude that 



Y„/ 7 



ct.S. « 



11 



n 



a.s. 



and 



Finally, notice that p(x) =p(x)l'v(x). We have 



v^(^ - v) 3 N(0, 2S p / 7 2 ), v^(^ - v) 3 iV(0, A»). 



gp(g) 

00 



I - l'v) 



9p(©)l' 



90 



v(I - l'v 



dv(0) 



(I - l'v) 



O + 7 



3v(0) 5v(0)l' dv(0) 



50 



50 



-v = 7- 



90 



The proof is now complete. □ 
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Remark 5.1. One can also show that n 1 Cov{U n , V n |} — > A™ a.s. for 
some A tB . So, 

nV 2 (Y n /n-v,-N n /n-v)ZN(o, (^J^ t A « 

Actually, with a more careful but complex proof similar to that in [16], one 
can show that the process n~ l / 2 (Yy nt ^ — [nt]v, N[ nt ] — [ni]v) is weakly conver- 
gent to a 2i^-dimensional Gaussian process (Gi(i), G2(i)) in the Skorohod 
topology, where G±(t) is the solution of 

dd(t) = (IWi(t)H + dW 2 (t) + W3 ® Fdt + ^^-TLdt, 

Gi(0)=O, 

(Wi(t),W2(t),W3^)) is a 3-KT-dimensional Brownian motion with variance- 
covariance matrix defined in (5.23) and 

* Gi(5) 



G 2 (t)=W 1 (t)+ / -^ds(I-l'v) 

JO s 



Furthermore, we also have 

n" 1 Cov{Q n , U n |} - (S 23 + E 3 )(I 

and 



H) 1 



a.s. 



n^ 1 Cov{Q n , V n |} - (£ 23 ' + E 3 )(I - H)- 1 ^ - l'v) a.s. 

This shows that the joint distribution of n 1 / 2 (@ n — 0), n 1 / 2 (Y n /n — v) 
and n 1 ' 2 (N n /n — v) converges to a mean-zero 3iT-dimensional Gaussian 
distribution. 



If A = 1/2, one also can show that (5.22) holds with (ralog 



2u-l n )l+<5/2 

replacing n r > 2 , and that (nlog 2 ^ 1 ra) _1 Var{(U„, V n )|} converges to a 2d x 
2d matrix. This implies that the asymptotic normalities of Y„ and N n also 
hold. 

Remark 5.2. A^ and A" can be derived. As an example, we provide 
details to derive the second part of A", that is, (I — v'l)A 2 (I — l'v). Other 
similar terms can be derived in the same fashion. Write T = (1', T2, • ■ • , T s ) 
and T*(I-v'l)Al(I-l'v)T = C£ gh ,g,h = l,...,s). Note that 

t*a 2 t 



dx 



dx 



x y \x 
1 

diag 



diag[0,J2 



y \x 



dy 



.1 



T*£ 2 T 
J 



1 ify 



diag[0,J2,...,J s 



y \x 



Ux y \x, 

dy T*E 2 T 



dy 



diag 



ify 



y \x 



y \x 



dy 
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So, 



T*(I- v'l)A 2 tt (I- l'v)T 

= diag(0, 1, . . . , l)T*A«Tdiag(0, !,...,!) 









dx 


/o 




X 









diag 



y \x 



diag 



Also, 



where 



■ 



y \x 
r J 



y \x 



•i 



dy 



dy 



T*£oT 



a=0 







1 


... 


°\ 






f° 





1 . 


• °\ 










1 ... 















. 


. 


Jt = 










J 


T 2 - 


















... 


1 












. 


. 









... 


o) 











. 


• o) 


11 = 0. 


For 


9, 


h = 2,. 


.,s, 




= = 


o, 









J gh 



dx 



lx y \x 

and the (a, 6)-element of ^ g h is 

a-l 6-1 

2J X [1 , gSiT/ l ] a _ a ' ) 6_6' 



T*£ 2 T h 



x y\xj 



a'=0 6'=0 



'0 



- 1 (y/x) 5 



log a [l)dy 



Hy/x) Xh yfy\ 



V\y 



x J 



dy 



dx 



a-l 6-1 
a'=0 6'=0 



E^( 1 -A 9 )- (o '- m) (l-A 9 -A,)-^+ 1 ) 



1=0 



where [T*S 2 T fe ] a / ;6 / is the (a', &')-element Q f the matrix [T*S 2 T /l ]. 
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